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The present application is related to the application of Allan W. Ristow and Thonnas B. Hawkins for 
"Method and Apparatus for Font Data Storage and Retrieval", filed simultaneously herewith and assigned to 
the same assignee. 

The field of the invention is the storage, object manipulation, and image processing for output 
5 presentation of graphic objects, and more particularly, of text graphic objects in digital typography. 
Representation of typographic data in digital form involves two problems which are well known in the art. 

The first of these problems is the toss of detail accuracy which must inevitably occur in the transfer of 
the (analog) "master" design created by a typeface design artist to digital form. Quantization effects, even 
at very high scanning resolutions, provide visual distortion of otherwise smoothly flowing lines. As a result, 

70 the close approximation of the "master" by a dataset of originally-scanned data requires a very large 
amount of storage. In the earliest forms of digital typography, for example, storage and display methods 
were based upon bitmaps, which are simple to store, retrieve, and display. This early methodology, 
however, consumes a significant amount of storage space if many character (point) sizes or output 
resolutions are to be supported by the character output device. 

15 An alternate approach is to convert the originally-scanned quantized data into a connected set of 
mathematically-defined boundaries of two-dimensional regions representing the "inside" or "outside" of a 
character. The boundaries are comprised of "curve elements" which, in the most primitive implementations, 
are simple line segments. fVlore sophisticated systems use more complex shapes such as quadratic or 
cubic Bezier curves. All of these systems have the objective of storage of these mathematical shapes, and 

20 reconstruction of the digital representations of the characters through properly constructed physical 
mechanisms or computer instructions. A major benefit to the mathematical description is the ability to 
render output characters at any size by scaling the coordinate information stored in the character data 
descriptions, thereby eliminating the need for storage of bitmap data at every desired point size and/or 
output resolution. An added benefit of a mathematical description is the ability to subdivide the original 

25 outline into as many "curve elements" as necessary and to store the data representing them at whatever 
precision is required for replication of an original "master" design to the degree permitted by the scanning 
resolution. Once again, however, the mere closely the outline representation of the scanned data approxi^ 
mates the original design, the larger the dataset describing the outline. This is referred to as the "high 
resolution" problem with digital representation of typographic data. 

30 The second problem well known in the art of digital typography is the "low resolution" problem, in 
which the quantization of an analog design causes visual distortion and other effects resulting from 
misalignment with the two-dimensional raster "grid" on which the result is to be rendered for display or 
recording on an output medium. At lower resolutions such as those used for current "desktop" printing and 
video display devices, simple scaling of the character outlines produces unacceptable results. The problem 

35 is manifested in the scaling of mathematically-defined character outlines to low point sizes relative to the 
raster grid. Typical misalignment effects include loss of symmetry or "stem dropout" within individual 
characters, different appearance of supposedly identical character shapes due to differing phase with 
respect to the grid, and misalignment of a (horizontal) line of text or of a (vertical) column of characters. 
The "low resolution" problem of digital typography has been addressed by the methodology hereinafter 

40 referred to as "hint-based scaling", and is well described in Hawkins US 4,675.830 (herein incorporated by 
reference). The process developed in the Hawkins patent, along with others aimed at similar objectives, 
over-rides the normal scaling process and repositions many of the points of the character outline 
boundaries (thereby causing some pixels which would otherwise be "off' to be turned "on" and vise versa). 
This repositioning is meant to produce a more pleasing and harmonious collection of text characters at any 

-^5 given size and output resolution. The process is based upon data added to the character outline data, 
referred to in the art as "hint data" which establishes standard relationships globally defined for a given 
typeface design, along with those within individual characters, and uses them to adjust the shape of the 
contour of each character to fit the output grid as it is rendered. A successful implementation of this 
methodology is the INTELLIFONT scaleable font database offered by Agfa Division of Miles Inc. ("INTEL- 

50 LIFONT" is a trademark of Miles Inc.). Variations on the technique can be found in the POSTSCRIPT "Type 
1" font scaling technology offered by Adobe Systems Inc. ("POSTSCRIPT" is a trademark of Adobe 
Systems Inc.), and the TRUETYPE font system offered by Apple Computer. Inc., and Microsoft, Inc. 
("TRUETYPE" is a trademark of Apple Computer, Inc.). These technologies have in common the fact that 
additional data is included for every scaleable font, and for every character of a scaleable font, to provide 

55 the hint data. Moreover, a considerable amount of effort is spent in designing and applying this hint data to 
the character outlines meant for general use in computing systems. 

From the above discussion it is clear that faithful representation of digital typographic data in both the 
"high resolution" and "low resolution" domains requires a significant amount of data, since both detailed 
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outline information and hint data are present. The original purpose of providing a scaleable representation of 
text graphic objects, namely the reduction in storage requirement, has been compromised by the need to 
include additional data to modify the scaling process at commercially-required output resolutions. The fact 
that rendering speed is a major requirement for efficient display and output involving text means that the 

5 data storage vs. runtime computation tradeoff tends towards increasing the storage requirement in order to 
minimize runtime computation. The above considerations, together with the demand for a larger variety of 
typeface designs in everyday applications such as desktop publishing point to an ever increasing storage 
requirement for font data. A typical scaleable font stored in the format of the INTELLIFONT system requires 
40 - 70 Kbytes of storage. For an environment using 200 scaleable fonts, a requirement of 8 - 14 MBytes is 

10 indicated for data storage only. Accordingly, the need exists for efficient font storage and retrieval 
mechanisms which minimize the data storage requirement, supporting high-speed scaling, manipulation and 
rasterization, and at the same time meeting the quality requirements of both "high resolution" and "low 
resolution" domains. 

Methods for compression of font data sets are well known in the prior art. To place them in perspective 
75 and to understand better the objectives and features of the invention, the information content of typographic 
data is reviewed, along with the means by which data compaction can be accomplished with minimal loss 
of required information. 

The outline of a character in a typeface design represents two distinct forms of information conveyed to 
a human viewing the character on a display device or output medium. The first is the typeface design- 
ee independent character "glyph", which distinguishes a letter "A" for example, from a letter "B". The second 
form of information is the typeface design-dependent "style" of the character, which distinguishes a letter 
"A" produced in the TIMES typeface design from one produced in HELEVETICA ("TIMES" and "HEL- 
VETICA" are trademarks of Linotype AG and/or its subsidiaries). In hint-based scaling, both forms of 
information are used, the first to ensure that the character glyph is recognizable, and the second to ensure 
25 that a consistent typeface design "look" is achieved- There is in general substantial redundancy contained 
within the outline of a character, since it Is the essence of good type design that many features 
distinguishing a typeface design are invariant from character to character. In the case of digitally-rendered 
text, the only additional information conveyed is the aesthetic appearance of characters optimally aligned to 
the grid of the output medium. 
30 Content-based data compaction involves removing (or minimizing) redundancy, as opposed to non- 
content based methods which recode or reformat data. Methods for content-based compaction of typo- 
graphic data fall into three categories: 

1. Using full outline character descriptions to infer hints using a rule-based procedure, and applying 
these inferred hints to a scaling process; 
35 2. Using typeface design-dependent hints only, reconstructing the outline using a standard predeter- 
mined set of character glyphs; and, 

3. Factoring the typeface design-dependent and typeface design-independent information and retaining 
only a minimum set to represent both. 

Methods for data compression by content compaction have been available for years, differing in the 
^0 ways in which the redundant and non-redundant information are stored. An early electronic example from 
the I960's is the "Nixie tube" and later embodied in LCD displays, in which any displayable graphic is 
made up of a combination of pre-defined "strokes", each of which is present or not (i.e., "on" or "off" in the 
Nixie tube), thereby allowing any possible graphic to be represented as a binary number of order n where n 
ts the number of available "strokes". Since the set of strokes is fixed, such a system provides at most 2" 
^5 unique character glyphs, effectively representing only a single "font". 

Another example can be found in the "turtle" graphics of the MIT Logo Laboratory, in which a complete 
complex outline is described as a sequence of primitive pen-based operations tracing a line. This approach 
was extended in the POSTSCRIPT rendering of graphic shapes. In these methods, a large descriptive 
dataset is replaced by a procedural description, i.e.. a "program" which will, when executed, generate the 
50 outline. 

Two current commercial examples of data compression by content compaction are the INFINIFONT 
and PANOSE products offered by ElseWare Corp.. Seattle. WA. and the FONTCHAMELEON product, 
offered by Ares Software, Foster City. CA. For a description of these products, see "The Hardcopy 
Observer", May 1993. pp. 17-22. and "Seybold Report on Desktop Publishing", Volume 8 Number 2 

55 October 4, 1993. 

The original PANOSE system was developed in the mid 1980's as an aid to font classification for the 
purpose of font substitution. The original system classified 240 typeface designs in terms of a 7-digit 
"PANOSE NUMBER" with each digit describing one visual typeface design characteristic (e.g.., the kind of 
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serifs, position mid lines, etc.). The system was later expanded to the present form representing quantita- 
tively a total of 36 characteristics of a typeface design. The information content of the expanded PANOSE 
NUMBER provided the means for font substitution by matching PANOSE NUMBERS. In addition, the 
system provided for the dynamic construction of fonts using generic "stick figure" glyphs for each character 

5 as the starting point and using the 36 parameters for determining stem weight, serif shape, curvature, etc. 
The method was further expanded as the INFtNIFONT product, to provide for reconstruction of character 
outlines approximating generic typeface designs. 

The heart of INFINIFONT is a set of software modules called a "TERAFONT". A TERAFONT 
comprises, as an interpreted "program", a description applicable to a large number of similar typeface 

10 designs, for example, those of Latin-based alphabets. With only a PANOSE NUMBER for a given typeface 
design, execution of the TERAFONT at runtime produces a font meeting the generic specification. Addition 
of typeface design and character-specific information to the 36 parameters of the PANOSE NUMBER 
permits an approximation of an actual typeface design (including scaling hint information if available) with 
accuracy limited only by the size of the additional data supplied. 

15 The FONTCHAMELEON product incorporates one or more "master fonts" and more than 200 typeface 
design descriptors that reshape the master font to simulate popular typeface designs such as TIMES and 
HELVETICA. Each master font comprises outlines containing as control points all those required to define 
any of the typeface design style variations supported by the master. For Latin alphabets, at least two 
master fonts are required: one for non-italic and the other for italic styles. The individual typeface design 

20 descriptors define only those required for a given typeface, leaving the remainder unused as redundant 
points. Each master font requires about 200 KB of storage, and each typeface design descriptor uses about 
3 Kbytes of disk space. 

Still another approach to compaction and dynamic font construction is the linear interpolation of 
previously stored character forms as implemented in the IKARUS typeface design system ("IKARUS" is a 

25 trademark of URW, AG) and as disclosed in Markoff and Deubert US 4.933,866. In this approach, two forms 
(e.g., one designated "expanded" and the other "narrow") are pre-stored as pre-defined sets of contour 
points with differing actual locations in the plane cf the output medium. A linear interpolation procedure, 
carried out point-by-point creates a new outline which lies between the two extremes. This procedure allows 
a theoretically infinite number of new fonts to be created from two originally stored, using only a single 

30 scaling parameter given for each coordinate direction. As a result, the method provides flexible run-time 
creation of new fonts, but only as linear combinations of those already stored. 

The above methods in the prior art have in common the compression of font data by content 
information compaction to a varying degree. The INFINIFONT and FONTCHAMELEON systems both 
replace individual typeface design-dependent character outlines with one or more master outline data or 

35 procedural descriptions from which approximations of character outlines are constructed on-the-fly by 
typeface design-and character-dependent descriptors. Use of the minimum descriptor information, e.g.. the 
PANOSE NUMBER, results in a generic approximation of a typeface design, while still requiring the master 
outline in the form of the TERAFONT. In the method of Markoff and Deubert. two fixed sets of outline data 
are used to create a third. In each method, at least one outline structure is required in addition to supporting 

40 data in order to achieve faithful character 'reconstruction. Since the outline requires significant storage, the 
degree of compaction is limited by the number of fonts which can successfully be described by the same 
outline information. 

A second area of commonality of the prior art methods, including the INTELLIFONT scaleable font 
database methodology and those described above, is the redundancy of data stored for representation of 

45 both the outlines (or their content compacted equivalent representation) and the hint data which indicate 
where the boundaries should really be. Even "auto-hinting" methods aimed at construction of hint data from 
given outline data make the assunription that there exists a substantial information overlap between the 
outlines and the hints. These auto-hinting methods, although reducing the storage needed for the added 
hint data, still require the full definition of a character outline. The reverse approach, i.e., the use of a hint 

50 data description as the basis both for reconstruction of a character outline and for hint-based scaling, is 
unknown in the art. 

It is accordingly a general object of the invention to store and retrieve digital data representing graphic 
objects in forms usable for display or recording on an output medium at any resolution and object size. 

It is a specific object of the invention to store and retrieve digital typographic data in scaleable form 
55 suitable for generation of pixelmaps fit to the grid of an output medium, at any resolution and point size. 

It Is a further specific object of the invention to store and retrieve digital typographic data in scaleable 
form suitable for use as hinted character outlines for reproduction at any point size with grid alignment to an 
output medium. 
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It is a still further specific object of the invention to maintain digital typographic data in a form which 
preserves the distinguishing features of each character and each typeface design while minimizing the size 
of the stored dataset in the data storage medium. 

It is a general feature of the invention that the time required for processing of the stored digital 
5 typographic data to a form suitable for display or output recording is comparable to or less than that 
required for that of other methods known in the art. 

It is a specific feature of the invention that the time required to produce a single character bitmap from 
the stored form is faster than producing the same bitmap from an outline font stored in prior art formats. 
It is a further specific feature of the invention that characters having an arbitrary shape can be added to 
10 a font collection by straightforward data entry procedures. 

It is a still further specific feature of the invention that the visual aspects of a typeface design can be 
reproduced to a high level of quality without compromise of the minimal storage or processing time 
objectives. 

It is another further specific feature of the . invention that the storage and retrieval process is 
independent of any specific font scaling technology, provided that the information content of the font 
database is at least sufficient for the level of reproduction accuracy required. 

It is still another further specific feature of the invention that new fonts can be synthesized from an 
existing font database by systematic modification of the parameters associated with one or more typeface 
designs. 

The present invention is a method and apparatus for data compression of digital data representing 
graphic objects, and is particularly applicable to digital typographic data due to the structural inter- 
relationships associated with graphic objects in typography. The method is based upon the recognition that 
there are two data sets which independently describe the outline of a character in a scaleable font: the 
outline points themselves; and the hint data which is used to determine where the outline points end up 
25 after scaling and pixelmap grid-alignment. Because much of the same information is contained in both 
datasets, the outline data set is replaced by a data set which more closely resembles the hint data than that 
of the outlines. 

In the method of the invention, a "character model" is abstracted from the data for an actual character 
by separating the information describing the actual contour outline from the "associations" between pairs of 
specially-selected contour outline points. Rather than attaching an actual (typeface design-dependent) 
measurement, i.e., dimension, to each such association, an abstract form of the association is created by 
assigning a (character-dependent, but typeface design-independent) dimension name such as "horizontal 
cap stem" or "vertical curve stem". Dimensional data applicable to an entire collection of characters for a 
given typeface design can be stored once for the superset of abstract dimensions applicable to the 
typeface design instead of for each character of the collection, due to the fact that a horizontal cap stem for 
any character of a given typeface design has similar physical dimensions when rendered in the output 
medium. 

In addition to the character model capturing the associations, parametric descriptions are created for 
each contour segment, i.e.. each portion of the character outline extending between adjacent pairs of 
^0 reference outline points, called "segment end-points", that are linked by the associations. These parametric 
descriptions, referred to as "segment descriptors", indicate the form of a contour segment (line, quadratic 
Bezier curve, Bezier curve, etc.) and include the minimum number of reference outline points necessary to 
reconstruct it accurately given the actual positions of the segment endpoints when rendered in the output 
medium. 

^5 The character model forms a set of "shapeless" contour loops. This set is analogous to a network of 
knotted strings, with the "knots" being the segment endpoints, and the "strings" connecting them 
representing the segment descriptors. The shapeless network takes on the actual shape of a character 
when, and only when, the positions of the "knots" are defined (by assigning actual typeface design- 
dependent dimensions to the distances between them), and the "strings" connecting them are laid down 

50 according to the segment descriptors and thus made "rigid". Accordingly, the data for a complete character 
description for any given typeface design is separated into three subsets: 

1. A character model which captures the associations applicable to the equivalent characters of a large 
number of similar typeface designs; 

2. Measurements for each of the dimensions defined for the associations within the character model- 

55 and, 

3. Segment descriptors for the contour segments connecting adjacent segment endpoints. 

Of the above subsets the first is stored for each character only once for a single collection of "similar- 
typeface designs. The data in the remaining two subsets is stored either once globally if it pertains to a 
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number of typeface designs, or within the "character local data" if it is unique to a specific character. 

The most general procedure for reconstruction of a character outline for a particular font stored 
according to the above character data description involves three steps: 

1. The dimensions are applied to the associations within the character model, thereby obtaining the 
5 actual locations of the segment endpoints; 

2. The segment descriptors, together with the segment endpoints determined in step 1, are used to 
reconstruct the actual contour segment between each successive pair of segment endpoints making up 
the character outline; and, 

3. The reconstructed contour segments are used for output of the complete character outline, either as a 
7 0 rasterized grid-aligned pixelmap or in another form usable in digital typography. 

The above processing steps are similar to those occurring in a typical hint-based scaling process. 
However, in this case they are being used, without the necessity of added computation time, for three 
objectives: (1) decompression; (2) scaling; and, (3) grid-alignment if desired. 

Finally, since the dimension and segment descriptor data subsets can be stored separately, it is 
'5 possible to manipulate them individually or in combination to effectively synthesize new scaleable fonts 
based on those originally stored. 

The method differs from those of the prior art in several important ways. First. NO master outline 
dataset is required in addition to typeface design-specific character descriptions. The second distinction is 
that reproduction of any character can be achieved to meet industry standards for faithful reproduction of 
20 the typeface design, without approximation or necessary addition of a significant amount of character- 
specific data. The third distinction is that characters having an arbitrary shape can be added to a font 
collection without revision of existing data structures or processing systems. 

The objects and features of the invention will best be understood from a detailed description of a 
preferred embodiment thereof selected for the purposes of illustration and shown in the accompanying 
25 drawings: 

FIG. 1 shows a system context diagram for the preferred embodiment of the invention; 

FIG. 2A and FIG. 2B illustrate defined terms used in the detailed description of the invention; 

FIG, 3 shows a block diagram of the data flow and processing sequence of the digital output subsystem; 

FIG. 4 shows a block diagram of data flow and processing sequence of the character generator module; 
30 FIG. 5 shows the data flow in the conversion of a text character to grid-aligned scaled contour segments; 

FIG. 6 shows a character model example given for the letter "R" in three different typeface designs; 

FIG. 7 shows a detailed view of the horizontal elements of the character model of FIG. 6; 

■FIG. 8 shows a detailed view of the vertical elements of the character model of FIG. 6; 

FIG. 9 shows representative normalized contour segment types with illustration of the parameters used 
35 for their characterization; 

FIG. 10A and FIG. 10B provide an example of reconstruction of an actual outline segment for a given 

segment parameter set; 

FIG. 11 compares the printed results obtained using the method of the invention and a reference 
scaleable font data storage method for two typeface designs; 
o FIG. 12 shows the result of character outline reconstruction for the letter "H" for 35 typeface designs 
using a set of 12 models; and. 

FIG. 13 shows a data processing flowchart for construction of a character database according to the 
method of the invention. 

The fact that character outlines can be completely reconstructed without the use of a master outline is 
45 based upon the process of "factoring" of redundant information. This can best be understood by 
considering the information contained in a character to be factored as the "union" of character "glyph" 
shape information, dimensions of standard parts such as stems, caps, etc.. physical associations between 
portions of a character which are preserved over a range of point sizes and resolutions, and outline 
topology considered as a sequence of curve segments. Redundancy occurs within a group of typeface 
50 designs when all instances of the same character glyph shape differ only by one or mere of the other 
information components. Similarly, redundancy occurs when a group of characters belonging to the same 
typeface design have the same standard dimensions and differ only by glyph shape and outline topology. In 
these cases, the redundant components can be factored out and made separate from the information which 
is represented by data stored for individual characters. Several examples exist for the factoring of 
£5 information: 

1. Dimensions can be factored out of character models and made font global (as done presently in 
INTELLIFONT, POSTSCRIPT. TRUETYPE); 
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2. Associations can be factored across many typeface designs for a single character glyph shape. For 
example, many instances of the letter "H" have the same basic model, but different dimensions; 

3. Outline topology can be factored across many characters just as associations; 

4. Segment endpoints can be recovered from associations and dimensions provided that the association 
5 structure is rich enough. Some segment endpoints need to be described by data which is specific to the 

character to which it applies; and, 

5. The factoring of curve segments into endpoints and curve descriptions allows the essence of the 
curve to be described in a compact form. The actual curve can be derived from this curve description 
and the endpoints. This approach is successful because, for line segments and simple curve elements 

w the endpoints form a significant amount of the information needed to describe the curve segment. These 
curve descriptions provide further information factoring In the form of a single curve description 
applicable to many actual curve segments. 

Turning now to the drawings, an implementation of a computer system using the method and apparatus 
of the invention will be described in detail. FIG. 1 shows a typical application context of this system in 
;5 which a user (110) uses a keyboard (120) to access a computer (100) containing a computer program 
(130). The computer program (130) can be a process which reads from or writes to a data store (140) and 
can operate with varying degrees of user interaction, including no interaction after startup. In the typical 
application environment shown in FIG. 1, the computer program (130) can generate output (150) in the 
form of text and graphics for display on a video display monitor (170). as well as for output to a device 
20 (185) (hereinafter referred to as a "printer") for printing or recording on an output medium (190) such as 
paper or film. The output (150) of the computer program (130) is encoded as a sequence of digital signals 
which are converted into electrical signals for -output to a physical output device. This conversion is carried 
out by a digital output subsystem (160), in which the signals (150) are converted Into a form (165) for 
output to display monitor (170) or a form (175) for output to the printer (185). 
25 As an aid to understanding the discussion to follow, the terms defined herein apply to the following text 
and claims, and are indicated by small capitalization. A CHARACTER is a graphic object conveying 
information such as typographical information, having a visually recognizable appearance A TYPEFACE 
DESIGN is a specific set of CHARACTERS originally produced by an artist, conveying a particular visual 
style which is shared by all CHARACTERS of the same TYPEFACE DESIGN. A FONT Is a collection of 
30 CHARACTERS having the same TYPEFACE DESIGN and point size (e.g., "12 point TIMES-ROMAN Bold 
Italic ). A SCALEABLE FONT is the data representation of FONTS of the same TYPEFACE DESIGN such 
that CHARACTERS can be obtained from the same data over a range of point sizes A TYPEFACE 
comprises the set of TYPEFACE DESIGNS having common design elements, eg the TYPEFACE 
HELVETICA" comprises the TYPEFACE DESIGNS "HELVETICA", "HELVETICA Bold" "HELVETICA 
35 Italic", "HELVETICA Narrow" etc. 

A STORED CHARACTER is a data representation of all information used to process and display a 
single CHARACTER at a single resolution and point size. A STORED SCALEABLE CHARACTER is a data 
representation of all information used to process and display a single CHARACTER over a range of point 
sizes and resolution values according to a given scaling technology. A FONT DATABASE is a plurality of 
40 STORED CHARACTERS. A SCALEABLE FONT DATABASE is a plurality of STORED SCALEABLE 
CHARACTERS. A CHARACTER SET is a set of identification codes according to a number system (such as 
Unicode numbers) each of which uniquely identifies a recognizable CHARACTER in a TYPEFACE DESIGN- 
independent and style-independent manner. The CHARACTER CODE is the unique identification code of a 
single character within a CHARACTER SET. 
-5 The following defined terms are illustrated in FIG. 2A. An OUTLINE ELEMENT (200) is a single straight 
or curved line segment that is used to define a portion of the boundary of a CHARACTER (210) Examples 
of OUTLINE ELEMENTS well known in the art include straight line segments, quadratic Bezier curve 
r!^jrfcMTo"'' ^""^'^ ^'""^ segments. The CHARACTER OUTLINE (212) is a collection of OUTLINE 

bLEMENTS (200) which completely determines the boundary of the CHARACTER. A CONTOUR SEG- 
50 MENT (214) is a sequence of one or more OUTLINE ELEMENTS (200). SEGMENT ENDPOINTS are the 
endpoints of a CONTOUR SEGMENT (see for example the specific points (222) and (224) in FIG 2A) A 
SEGMENT ENDPOINT NUMBER (226) is a number assigned to each of the SEGMENT ENDPOIhJTS 
(220) in a CHARACTER OUTLINE (212) sequentially increasing in the order that they follow each other 
along the CHARACTER OUTLINE (212). 

A NORMALIZED CONTOUR SEGMENT is a CONTOUR SEGMENT described relative to fixed starting 
and ending points which are the same for all NORMALIZED CONTOUR SEGMENTS A SEGMENT 
DESCRIPTOR is the set of parameters which, in addition to the fixed starting and ending points, completely 
defines a CONTOUR SEGMENT. »h . h^eieiy 
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The following definitions are illustrated in FIG. 2B. An ASSOCIATION is a function which determines the 
position of a SEGMENT ENDPOINT. A SPATIAL ASSOCIATION is an ASSOCIATION whereby the position 
of a SEGMENT ENDPOINT is given as a function of a fixed distance, a given direction, and the location of 
another SEGMENT ENDPOINT. For exanaple. the SPATIAL ASSOCIATION (230) gives the position of 
SEGMENT END POINT (232) as a displacement by a fixed distance from the SEGMENT ENDPOINT 
(234) in the direction shown by the arrow. A NON-SPATIAL ASSOCIATION is any ASSOCIATION which is 
not a SPATIAL ASSOCIATION. 

A DIMENSION (240) is a numeric quantity corresponding to the (TYPEFACE DESIGN-dependent) fixed 
distance parameter of a SPATIAL ASSOCIATION, A CHARACTER MODEL (250) is the collection of all 
SEGMENT ENDPOINT NUMBERS (226) and ASSOCIATIONS defined for a given CHARACTER (230). It 
should be noted that a CHARACTER MODEL (250) is an abstraction of information. Different CHAR- 
ACTERS (210) can be fitted with CHARACTER OUTLINES (212). SEGMENT ENDPOINTS (220) and 
ASSOCIATIONS (230) such that they correspond to the same CHARACTER MODEL (250). 

A PIXELMAP is a (typically two-dimensional) array of elements each corresponding to one resolution 
element of the output display or recording device (referred to as a PIXEL), and each having a single value 
to be output for that resolution element by the device. A special case of a PIXELMAP is a BITMAP, wherein 
the single value assigned to each element can take on one of two values (referred to herein as "black" or 
"white"). The PIXELMAP GRID is the grid, defined by the resolution of the output device, which contains 
the PIXELMAP. 

PIXELMAP GRID ALIGNMENT is the process of adjusting a CHARACTER OUTLINE at low resolutions 
and point sizes to obtain the best fit of the CHARACTER OUTLINE to the PIXELMAP GRID when viewed 
singly and together with other characters. 

The digital output subsystem (160) of FIG. 1 is shown as a block diagram in FIG. 3. The diagram shows 
the data flow and processing carried out in the conversion process from digitally-encoded text or graphics 
(150) to the electrical data and control signals used for display output (165) or printer output (175). An 
output driver (310) first separates the streams of digital signals for text and graphics into a text stream 
(320) and a graphics stream (330). The text stream (320) is processed by a character generator (340) 
thereby producing a raster data stream (360), encoded as digital signals representing PIXELS in a 
PIXELMAP for display or printing. The graphics stream (330) is processed in a similar fashion using a 
graphics generator module (350). which produces a raster data stream (370) similar to the raster data 
stream (360) produced for the text stream (320) by the character generator (340). The raster data streams 
(360) and (370) are merged and converted into electrical signals (165) and (175) in the display/ printer 
interface module (380). 

Implementations of the output driver module (310), graphics generator module (350), and display/ 
printer interface (380) all use methods well known in the art and need no further discussion. The method 
and apparatus of the present invention are embodied within the character generator module (340). for which 
a data flow and processing diagram is shown in block form in FIG. 4. The incoming text stream (320) is 
used by an outline reconstruction module (410) to retrieve font data (420) from a font data store (430). In 
the preferred embodiment of the invention, the font data store (430) is a SCALEABLE FONT DATABASE, 
but it is noted again that a FONT DATABASE can be used as well provided the information content is 
sufficient to support the method. The outline reconstruction module (410) uses the method of the invention 
to generate CONTOUR SEGMENTS (440) for the text characters of the text stream (320). The CONTOUR 
SEGMENTS (440) are processed by a rasterization module (450) which generates their PIXELMAP 
representation and converts it ;o the serial raster output (360). 

Although the present discussion and examples given for the preferred embodiment of the invention 
assume output for rasterization in PIXELMAP form, i.e.. as a set of CONTOUR SEGMENTS the coordinates 
for which can be mapped directly to the PIXELMAP GRID by the raster output module (450), it is also 
possible to store the contour segment data (440) as a hinted outline font (470) in a standard format such as 
TRUETYPE. POSTSCRIPT, or INTELLIFONT. 

FIG. 5 summarizes the transformation of a text character from digitally encoded form to a grid-aligned 
CHARACTER OUTLINE for output as a rasterized PIXELMAP. The text stream (320) is made up of a 
sequence of digitally-encoded text characters (510) each of which is identified according to TYPEFACE 
DESIGN, CHARACTER CODE, and other information used for rendering at the desired position and size in 
the output medium. The decoded text character (512) contains a typeface design ID (514) uniquely 
identifying the typeface family, weight, style, etc.. a character ID (516) which is the CHARACTER CODE for 
the CHARACTER, and positioning and scaling data (518) referencing the CHARACTER to an output 
coordinate system. For output as a rasterized PIXELMAP. this is the PIXELMAP GRID for the output 
medium. The combination of typeface design ID (514) and character ID (516) is further decoded through 
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the use of translation tables (not shown) to obtain a set of indices (520) providing access to the font data 
(420) of FIG. 4. The typeface design index (522) permits retrieval of the TYPEFACE DESIGN-specific 
(global) data (524). The model group index (526) identifies a model group dataset (528) containing data for 
all available CHARACTER f^ODELS corresponding to the character ID (516) of the text character (510). 
The character model ind .x (530) identifies the actual CHARACTER f^ODEL (532) within the model group 
(528) to be used for the CHARACTER OUTLINE reconstruction for the text character (510). Finally, the 
character local data index (534) provides access to character local data (536) specific to the combination of 
the typeface design ID (514) and the character ID (516). 

The typeface design data (524) includes the DIMENSIONS (540) to be applied globally to the set of 
ASSOCIATIONS for all character ID'S (516) for the TYPEFACE DESIGN given by the typeface design ID 
(514). The character local data (536) contains DlfVIENSIONS for locally defined (CHARACTER fVIODEL- 
specific) ASSOCIATIONS along with the parameters for the SEGf^ENT DESCRIPTORS. The use of data 
defined locally permits customization of character shapes to an arbitrary degree. In the preferred embodi- 
ment of the invention. ASSOCIATIONS which are used in less than three CHARACTER fvlODELS are 
generally defined locally as an efficient tradeoff between local and global storage requirements. 

The global dimensions (540). together with the character local data (536) are applied to the CHAR- 
ACTER fy/lODEL (532) to produce scaled and (if so indicated in the CHARACTER MODEL data) grid-aligned 
SEGMENT ENDPOINTS (544). The CONTOUR SEGMENTS (440) are obtained by applying the SEGMENT 
DESCRIPTORS (546) obtained from the character local data (536), typeface global data (540) or fixed 
within the CHARACTER MODEL (532). 

In the event that the output of the character generator module (410) is used for generation of hinted or 
unhinted outline fonts in an industry-standard format, the CONTOUR SEGMENTS (440) are converted to 
the target outline data storage format. If the outline font is to be hinted, the CHARACTER MODEL (532) and 
the global dimensions (540) are converted to the target hint data storage format and added to the outline 
data. 

FIG. 6 shows a graphic representation of a single CHARACTER MODEL as it applies to the letter "R" in 
three different TYPEFACE DESIGNS, shown in three columns, and designated as "CG TIMES" (a 
trademark of Miles Inc.. Agfa Division). "COURIER" (not a trademark), and "ALBERTUS MEDIUM" (a 
trademark of Monotype. Inc.) respectively. The top row (610) shows the actual printed character that is to 
be stored Digitally in a scaleable. compressed form. The middle row (612) and bottom row (614) show 
some of the topological elements of the CHARACTER MODEL The horizontal components (612) and 
vertical components (614) have been separated for clarity. These components are shown in a more detailed 
view in FIG. 7 and FIG. 8 for the same CHARACTER MODEL. In FIG. 6, the ASSOCIATIONS are 
represented as solid or dashed lines which are connected in "tree" form, with the arrows indicating the 
direction of traversal through the tree structure. For the honzontal associations (612). for example a first 
node (620) has three ASSOCIATIONS (622). (624). and (626). the last of which traverses to the next node 
(628). Dashed lines indicate associations involving one or more non-grid-aligned points (e g the ASSOCI- 
ATION (630)). It is clear from FIG. 6 that although the printed CHARACTERS (610) are significantly 
different in visual appearance, the same set of associations appear in the horizontal elements (612) and 
vertical elements (614) of the CHARACTER MODEL, thereby illustrating the "knotted string" metaphor used 
previously to describe the CHARACTER MODEL in terms of a set of "shapeless loops". 

FIG. 7 shows the detailed view of the horizontal elements of the CHARACTER MODEL of FIG. 6 In the 
Figure, the positions of reconstructed SEGMENT ENDPOINTS are shown with their designated SEGMENT 
ENDPOiNT NUMBERS and are referred to in the following discussion as "point n" where n is a SEGMENT 
ENDPOINT NUMBER. For example "point 20" refers to the position of the reconstructed SEGMENT 
ENDPOINT having a designated SEGMENT ENDPOINT NUMBER equal to 20. The ASSOCIATIONS are 
indicated labeled with a symbolic name to be discussed in more detail below. SEGMENT ENDPOINTS 
connected by ASSOCIATIONS in the horizontal direction can all be reached from point 2 by following 
ASSOCIATIONS from point to point (points 0 and 1 define the character escapement (710). while points 2 
through 23 refer to actual SEGMENT ENDPOINTS on the CHARACTER OUTLINE. 

Also shown in FIG. 7 are examples of NON-SPATIAL ASSOCIATIONS. Those SEGMENT ENDPOINTS 
marked "PREV" are assumed to take on the same x-coordinate value as the previous SEGMENT 
ENDPOINT, e.g.. point 17 has the same x-coordinate as the x coordinate calculated for point 16 The 
SEGMENT ENDPOINTS marked "ADJ" take on an x-coordinate value determined by the ratio of the 
reconstructed x-coordinates of the points before and after along the CHARACTER OUTLINE For example 
the x-coord.nate of point 22 is determined by the fraction of the horizontal distance between points 23 and 
21, as indicated by the parameter stored for the "ADJ" ASSOCIATION. 
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TYpSlcTnSrNT' T"" f SOCIATIONS refer to either global DIMENSIONS (defined on a 

TYPEFACE DESIGN basis), or local DIMENSIONS (defined only for this CHARACTER MODEL) For 
f^ W Lr'°';VT 9'^^" ^° ASSOCIATION (720) Is a mnemonic name standLg 

s ACTErT uch asT "H-'etc'rH"' ^° ♦•^^ -^''^ °' char' 

• • ' ^"'^ "5®"^ ^'^^ ^''"es in the "R" model of FIG. 7 for the ASSOClATintM<; 
(720). (722) and (724). The DIMENSION corresponding to UCSLM, is sSed on ly o^ce ?n a cSal 
dimension table within the TYPEFACE DESIGN-specific data ((524) and (540) of Rg V 

CHARACTErandT. Z.TTT^ ' ""'"'"^ ASSOCIATION within a single 

w names "LOCAL " locaT? 'l ^'^'^'f" ^^'^ that CHARACTER. They are given the symbolic 
"LoS1l9'- ' ■ ASSOCIATION (730) having the symbolic name 

local'DIM™^'f ^^.'f CHARACTER MODEL, whether assigned a global or 

d-::.irin na 5; o^h: z ;c^r ^^'-^ °^ ^--^^ --^^-^^ 

" DF^^^M been described previously, the actual values of the DIMENSIONS for a particular TYPEFACE 
DESIGN are used to measure the distances between SEGMENT ENDPOINTS. These values can be scaled 
so as to produce the desired character size relative to a given PIXELMAP GRID (delrm!ned by S 

hos'e ?EG^ENf L^dJS^^^^ ''''''' ^^'^'^'^^^ coordinate valueVfo 

tnose SEGMENT ENDPOINTS to be used for the purpose of PIXELMAP GRID ALIGNMENT can be 

Z f ° ""'^ *° -•^^ther or not a given SEGMENT ENDPOlNT s to 

be gnd aligned ,s one of the parameters of the ASSOCIATIONS contained in the CHARACTER MODEL 
following rps'"' reconstruction of the horizontal coordinates is summarized in the 

1. The starting x-coordinate is taken to be point 0 and its value is set to 0 

foca?d^trsLTand°ldT.T';;'''' ^^^^^'^^'^^^ (^^0) is retrieved from the character 

rnn l J. T T f ^° '^-'^""^^'"^t^ °' P°i"t 0 to Obtain the x-coordinate of point 2 The x- 
ASSOaATION^ ' "''^'^^^ -"^^ a parameter of th^t 

3^ The DIMENSION for the SPATIAL ASSOCIATION "UCSLMV (720) is retrieved from the DIMENSION 
Ob .n S"" ° T ^^T^""^ ^""^'^^^ted from the x-coordl e of pdmTto 

T n c, 'Z'': °' ' ^'^^L^'^P GRID ALIGNED if called for) As described 

:;r,;;LirA?TErSE"L °' ^ ^^^^-'-^ - Oli^ENSr s-g" 

:::Ltde=^rrsra::^^^^^^^^^ 

L^e's'el' m?>fc' h''°'''"'i^' ASSOCIATIONS have been located according to the 

above s.eps, the x-coordinate values of all SEGMENT ENDPOINTS labeled "PREV are set to th^y 
coordinates of the previous points on the contour (e.g. the x-coordinate value of point 3 it set to tha, S 

6. Finally, the x-coordinate values of all SEGMENT ENDPOINTS labeled "ADJ" takP nn th w . 

SEGMENT ENDPOINTS previous and following along the contour. That fraction value is retrieved ror^ 

^iTTssociAToN;;::)'""^''^" - ^ - fractionra.uere;io;r 

G Jit tT^^V^'l equivalent detailed view of the vertical elements of the CHARACTER MODEL of FIG 

e ca a nV:,;Vd a?ac^ ''''' ''''^ ^^--^ standard fontwiS 

appiicabfe-rorgTverc^AR^^^^^^^^^^^ 

determine .he y-coordinates of each SEGMENT ENDPOINT^s ^IgortoTat descr b^^^ 
rcrec;:d°tr"s.:ad r r^'^'^^ '''' association netwl^ for the lertiral d^c^^n ee o 
coo dinaL For txlmo^^ °' ' ^'"^'"'^ °' sub-networks, each with its own starting y- 

-nH T=f< P • ^ '''"^'"9 for points 11 and 17 is the value of line BASELINE f8l2^ 

end that for point 5 is the value of the line CAPHFifiHT /«ift» Th» .■ B«:>tuiNt (812). 

.racio.. Of the vertical distance betleerBASEUN^(812rani CAPHBGrSrorr^^ f" 1' '? 
dimension value LOCALS (814). OAPHEIGHT (810) as defined by the local 

FIG. 9 shows a representative set of NORMA' IZED rnNxni ir <;cr^<CMT . 

rdiLr^^rTr, "iTth?r ^^^~Tmr^^7;;::;:^ 

coordinate pair (1,1). In the Figure, each type is shown with the start and end coordinate points together 
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^: ;etrj: rsr^*" 

' l^Z::, ' ""'^"""^ ' '™ pen, an» 

referred lo as Ihe control poini ihe OUAO ^ ' " 

throusn Ihe N-nh „rai9h, line of "he si,"e„c" """^"'"Sl *« "I Ihe firs, 

n„mh.r of^La, .^p.^'r.!!^,': z^z^ z:'^^:!:,^ " 

?5 AS defined previously, a NORMALIZED CONTOUR QPrK^icMT • ^« "erinea. 

^^^^^^^^^^ 

"0,™,. s,ore. ,n a „i„i.„r:, ^ Sr^fsletZr °' ' ^^^^^^ 

. ENOPOINTS .« p/lm"17™di,l,» ^ ' P.^ '^^"^"^ '^"^ SEGMENT 

NORMALIZED CONTOUR SELEr ,.rj r c^^^^^^ ™ '"^ 

,he Bezie, coniro, poims „, . cubic Be. er c"! be c - ,0 , -T" ""eeponding ,o 

. POin. (X', V) in ,he oulpirt coorainaTspa "e can to obfll:, , ' ' °* ™' '"'"'"''K »' 

poin, (X, y, in ,„e „„™,fe,a ,p,„7 sca^not^n^S , • °"«W"«"9 coordinales o, ,he 

: ,wo SEGMENT EN0PO,NTS P' and p' .Bh . !, ^ "analormaton: for a CONTOUR SEGMENT bef.een 
U, ,he NORMAUZED CONTOUR SEa:ENTl „rrrcVBlrald'rL~ 

=-»,eso,,ber,,^;viV;etrs.idrt 
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Given the start and endpoints P'-. and P'2 and the two control points C, and d. the exact Bezier 
curve can be rendered according to procedures well known in the art. 

The ability of the method to replicate the results of traditional scaleable font storage and retrieval 
methods is shown in FIG. 11 in which the same set of characters are rendered for a number of point sizes 
in two standard TYPEFACE DESIGNS, using the method of the invention (left column) and the standaro 
INTELLIFONT scaleable font database (right column). The too left set (1110) is the output for the CG- 
riMES TYPEFACE DESIGN as stored, retrieved, and rendered using the method and apparatus of the 
preferred embodiment described herein. The top right comparison set (1112) was obtained from data 
stored, retrieved, and rendered using the standard INTELLIFONT system. The lower left set (1114) and 
lower right set (1116) are the corresponding output results for the CG-TIMES ITALIC TYPEFACE DESIGN 

The data compression resulting from the method of the invention results from two important effects (1) 
the ability to re-use the CHARACTER IVIODEL; and, (2) the use of one set of contour points, ie the 
SEGMENT ENDPOINTS, to servo the dual purpose of outline reconstruction and hint-based PIXELMAP 
GRID ALIGNfvlENT. FIG. 12 shows an example of a model group comprising the set of CHARACTER 
MODELS used for various TYPEFACE DESIGNS for the letter "H" (having ASCII CHARACTER CODE = ' 
072). In the figure, the "ALBERTUS" TYPEFACE DESIGNS are trademarks of Monotype. Inc "MARI- 
GOLD" of Alpha Omega, Inc., "UNIVERS" of Linotype AG and/or its subsidiaries, "ANTIQUE OLIVE" of 
Olive. Inc., and "CG OMEGA" and "CG TIMES" of Miles Inc., Agfa Division. All other TYPEFACE DESIGNS 
have names which are in the public domain. 

Within the model group of FIG. 12, a total of 12 CHARACTER MODELS are needed to reconstruct the 
lette,' "H" for the 35 TYPEFACE DESIGNS shown. The CHARACTER MODELS with model index 1 and 2 
(designated as "Model 1" and "Model 2" respectively in FIG. 12) together represent 15 TYPEFACE 
DtSIGNS. A different model group exists for each text graphic object uniquely identified by a CHARACTER 
CODE, and the number of CHARACTER MODELS required to span the full set of TYPEFACE DESIGNS 
depends upon the variability of the shape of the object. As an example, a minimum of four CHARACTER 
MODELS are needed to represent the lower case letter "a" (ASCII CHARACTER CODE = 097) in both 
single story" and "double story" Roman and Italic forms. For the letter "0" (ASCII CHARACTER CODE = 
079) however, only three CHARACTER MODELS are required to represent the same 35 typeface designs 
Shown in FIG. 12. /r- y 

r^u.^n^^'^L''^ ^V^^ discussion, the average ratio of TYPEFACE DESIGNS to the number of distinct 
CHARACTER MODELS required to reconstruct them is about three. Re-use of the contour point data yields 
anotner factor of two. leading to an average compression ratio of approximately six as a direct result of 
using the method. Additional data storage savings are achieved in the preferred embodiment of the 
invention by using other content compression techniques well known in the art, e.g., using single-byte index 
values to represent the ASSOCIATIONS, for which DIMENSION data values are stored to higher precision 
only once for each TYPEFACE DESIGN. The final result for the preferred embodiment of the invention is 
indicated by a storage requirement of approximately 500 Kbytes for a collection of 45 TYFEFACE DESIGNS 
rvtlli^rV^L?f^°'^- 3PP^°><i"^ately 3 MBytes of storage required for the same collection of 
TYFEFACE DESIGNS using the standard INTELLIFONT scaleable font database methodology, therefore 
yielding an effective ratio of about 6:1. 

The preferred embodiment shown in the Figures and described above makes use of a one-to-one 
relationship between CHARACTERS and CHARACTER MODELS, i.e., individual CHARACTERS are con- 
structed each from a single CHARACTER MODEL using procedures as previously described, It is also 

^Mfp^PTPn Mont""""^"""^ ^^'P^'" ^-S- c°"^P°^"d characters, by use of more than one 

OHAHAUibR MODEL in combination. For example, the addition of a diacritical mark to a CHARACTER 
(such as an umlaut to the letter "U") yields a distinct CHARACTER with a unique CHARACTER CODE in a 
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CHARACTER SET. The new CHARACTER is otherwise identical to that without the diacritical mark The 
generation of the umlauted CHARACTER is thus accomplished using the CHARACTER MODEL of the non- 
umlauted CHARACTER together with a CHARACTER MODEL for the umlaut alone. As used herein the 
'^frutlT-r^^T^^I^^ SUB-MODEL shall mean a CHARACTER MODEL in a hierarchical combination 
of CHARACTER MODELS which together describes a complete CHARACTER. This hierarchical combina- 
tion IS called out in the decoding of the text character (510) of FIG. 5 based upon the character ID (516) 
and the typeface design ID (514), with the addition of ASSOCIATIONS used for transformation (relative 
positioning, scaling, rotation, etc.) of the individual CHARACTER SUB-MODELS. Although combination is 
used in the descnbed preferred embodiment only for compound characters such as diacritical characters 
there is no limit to the number of CHARACTER SUB-MODELS or the number of hierarchical levels with 
which this combination can take place. For example, it is feasible, through the use of CHARACTER SUB- 
MODELS, to construct CHARACTERS entirely from components, thereby achieving further content com- 
pression. While this approach is useful for the Latin TYPEFACE DESIGNS, it produces significant results in 
those Asian languages wherein a large number of characters are built up of combinations of "radicals- 
each combination of which is represented by a CHARACTER SUB-MODEL. Each such CHARACTER SUB- 
MODEL is independently transformed (i.e., positioned, non-linearly scaled, rotated, etc.) and combined with 
others to produce a single CHARACTER. 

The primary objective of the method of the invention is efficient storage and retrieval of data 
representing a scaleable font, providing for scaled character output without loss of accuracy with respect to 
any TYPEFACE DESIGN stored in the database. In addition, new TYPEFACE DESIGNS can be created by 
systematic modification of the parameters associated with one or more TYPEFACE DESIGNS already 
stored, thereby synthesizing a new font. For example, consider the ASSOCIATION UCXSM1 (742) of the 
character shown in FIG. 7, representing the upper-case vertical stem width. Scaling the DIMENSIONS of 
this and related ASSOCIATIONS for a single TYPEFACE DESIGN by a factor of 1.2 would create a new 
design for which some horizontal measurements have been broadened by 20% while others remain 
unchanged. Interpolation between DIMENSIONS corresponding to the same ASSOCIATIONS of two 
TYPEFACE DESIGNS, such as for example bold or non-bold versions of a TYPEFACE DESIGN would have 
the effect of creating a "semi-bold" version of the same TYPEFACE DESIGN No changes to the 
processing methods are involved other than pre-processing of the typeface global data used for the 
character generation. 

Just as new characters and fonts can be synthesized by systematic manipulation of DIMENSIONS 
Sim, ar procedures can be applied to SEGMENT DESCRIPTOR data. Altering the positions of the control 
points of a CUBIC, for example, would create a different shape for the CONTOUR SEGMENT reconstructed 

n.TTpfoi^^ °^ invention is data-driven, the procedure for constructing a SCALEABLE FONT 

DATABASE to be used with the preferred embodiment of the invention is achieved through the definition of 
da.a sets to be used by the outline reconstruction module (410) of FIG. 4, using the understanding of the 
art of typeface design, the rules for defining the ASSOCIATIONS, and the rules for entry and formatting of 
the data. These rules are based upon the TYPEFACE DESIGN art. along with an understanding of the 

jZTZrt nfJT'V'' CHARACTERS for an entirely new 

TYPEFACfc DESIGN can be created and added to an existing font collection without any other changes to 
he system. Similarly, entirely new CHARACTERS can be created for existing TYPEFACE DESIGNS 
through the assignment of valid CHARACTER CODES and updating the TYPEFACE DESIGN datasets as 

HGcessary . 

The processing steps for construction of a new or augmented SCALEABLE FONT DATABASE are 

rcrErBLrprl'rnlT.lcf '''' °' SCALEABLE CHARACTERS to be used in a new 
!n M^nm / I DATABASE or added to an existing SCALEABLE FONT DATABASE is processed in a 
loop (1300) described as follows. 

r^^J^^^^^^^^ CHARACTER in sequence, the text character is first selected (Box (1310)) and 
^en ified with respect to its CHARACTER CODE within a chosen CHARACTER SET. A CHARACTER 
M.i,u , " C^^^f^ACTER using well known automated or manual digitizing techniques (Box 

1312)). In many cases, the CHARACTER already exists in a suitable digital format (e.g.. as a dataset 
formatted by the IKARUS typeface design system). g . as a aaiaset 

,h. nl" 'ru^L'^rrlT^r. ^"^^^^CTER MODELS is visually scanned (Box (1314)) for applicability to 

CHARACTER SS?L,NF.°'".".T " '''' ^''''^ '''' °^ ^^-^^^^ ''-^^'ers' or 

ACTFR JnnP. ^'^'^'^^'^^^ combination as previously described, existing CHAR- 

ACTER MODELS and/or CHARACTER SUB-MODELS are considered as candidates for such a combination 
Ha match is found, the CHARACTER MODEL or combination of CHARACTER SUB-MODELS is sSed as 
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the CHARACTER MODEL for the new CHARACTER OUTLINE fBox M320U . u ■ , 
existing CHARACTER MODEL or combination of SaRACtS^ SufS^^^^^ 

local data. Finally, the SEGMEwf DeStORS ' ^^.^'^'^^ '"^^ ^^^^^^^er 

'0 applied to each CONTOUR SEGMENTTth^HA^^^^^^ ''""'^^ °' ^ P-edure 

.e i^o^T^jo, ™?n;^ iii:^:!^™^^^^^^^^^^^ rr " 

_ If this is a new SCALEABLE FONT DATABASE or a newTlEFACE D^i^^^^ '''''''' 
font database (Box (1370)) the global dimensions for the ^pe^ACe SeilGf^f h'". ''"'"^ 
5 according to procedures described below. Othe^e the nef.r^LKf . determined (Box (1372)) 
(Box (1374)) by storing, retrieving and reconr,rL fh 1^^ characters are tested for quality 

Visually comparing the — t'ed ^^LTsXr J^^^^^^^^ 

manually tuned in the dataset as necessarv tr. =oh>, « ^"'^"'^^l^cR OUTLINE. Individual values are 

newly created scaleable cha^L ,,e .ata llr^H , .h ^"^^ °' the 

later retrieval. '^'^a^acters, the data entered for them ,s formatted (Box (1380)) for storage and 

The global dimensions for the TYPEFACE nF<?ifiw ar= ■ ^ ,r, 

recording al. global dimensions called oC. by fh' c^^^^^^^^ 

scaleable characters belonging to the TYPEFACE DESIGN ^ "'^'^ "^^ °' 

to the same identified ASSOCIATION aoDearfnn m«n ? Measurements of DIMENSIONS corresponding 
njeasurements correspond.n^g' oTelrS ^73^:^^;^^^ 

slightly different values. Accordinqlv the averanp w«imI I . ^ ®* 9^"e^a"y yield a variety of 
.0 be verified and; or manually tunt' rinTStyr^Xrs^^ '^''''"^ ^ 

procid^s t^z^z SfE^rii^r^^"^^"^"^ '° - — 

SEGMENT END POINTS are located precisX and inln I f generated. From this representation. 
ASSOCIATIONS are defined reSing thrSEGMENT EnT^^ '^^'^'"^ 
FIG. 8, and this information is recorded in the CHARACTPrM^ IT '''I' ^ 
labeled according to their type. SPATIAL TsSciA^^? ASSOCIATIONS are 

"LOCALS (740) or "UCSLmT (720) ^FIG NO^SPAT AlSoc.A^^^^^^^^ " "^'"^ '^'^ 

the function to be applied (e.g. "PREV" 'Vj- p^I "^^"^'""^ ="°^d'"9 to 

established, including an indi'cation of whether o n t SmTp GmrAU^ ASSOCIATION are 

application of the ASSOCIATION to determine its cnrrP^nnnH / ALIGNMENT is to take place in 
This information is entered as data LTeh™. , !^ SEGMENT ENDPOINT location 

Having described !n detL a orefri.' '^TT^ ^^""''^^^ °' the new CHARACTER MODEL 
Skilled ,n L art .hat n^^ ^^^^ °- 'mention, it will now be apparent to those 

invention as defined in the following Ss '"^'^'^^ ^^°P^ °' the 



Claims 
1. 



A method for construction of a SCALEABLE FONT DATABA<5F idtn^ . ■ • u 

(A) Selecting a CHARACTER (210) from a set of nnf. («0) co^^PHSmg the following steps: 
said SCALEABLE FONT DATABASE («0)- CHARACTERS (210) to be stored in 

fcl PrlT"^ ^CHARACTER CODE (516) to said CHARACTER (210)- 
C Creating a CHARACTER OUTLINE (212) for said CHARACTER (210)- 
(D) Determining SEGMENT ENDPOImtq /oonx • r^.,.S J ^ 
CHARACTER ,1,0, .s>,rJ"^ZlTE?Sjrum ,^^^^^ <^°' 

ACTErSlM/so?"'"''""' --^«in;ASSOC,AT,ONS ,„ CHAR- 

C^tcVTR o'uur." a^r'"'"""' ^'^"^ »' 
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(H) Determining global DIMENSIONS (240) applicable to said set of one or more CHARACTERS 
(210); and. 

(I) Formatting said CHARACTER CODE (516). CHARACTER MODEL (250), local DIMENSIONS 
(240). and SEGMENT DESCRIPTORS (546) for each of said CHARACTERS (210) together with 
said global DIMENSIONS (240) to form STORED SCALEABLE CHARACTERS (510) comprisinq 
said SCALEABLE FONT DATABASE (430). ^ 

2. A method for generating STORED SCALEABLE CHARACTERS (510) representing one or more 
CHARACTERS (210), said STORED SCALEABLE CHARACTERS (510) to be included in a previously 
constructed SCALEABLE FONT DATABASE (430), said method comprising the following steps- 

(A) Selecting a CHARACTER (210) from said set of one or more CHARACTERS (210)- 

(B) Assigning a CHARACTER CODE (516) to said CHARACTER (210); 

(C) Creating a CHARACTER OUTLINE (212) for said CHARACTER (510)- 

(D) Determining SEGMENT ENDPOINTS (220) in a CHARACTER MODEL (250) obtained for said 
CHARACTER (210) using said CHARACTER OUTLINE (212); 

(E) Measuring local DIMENSIONS (240) for corresponding ASSOCIATIONS (230) in said CHAR- 
ACTER MODEL (250); 

(F) Determining SEGMENT DESCRIPTORS (646) for each CONTOUR SEGMENT (214) of said 
character outline; 

(G) Repeating steps (A) through (F) for any remaining CHARACTERS (210) of said set of one or 
more CHARACTERS (210); 

(H) Determining global DIMENSIONS (240) applicable to said set of one or more CHARACTERS 
(210): and, 

(I) Formatting said CHARACTER CODE (516), CHARACTER MODEL (250), local DIMENSIONS 
(240), and SEGMENT DESCRIPTORS (546) for each of said CHARACTERS (210) together with 
said global DIMENSIONS (240) to form said STORED SCALEABLE CHARACTERS (510) to be 
included in said previously-constructed SCALEABLE FONT DATABASE (430). 

A method for generating a STORED SCALEABLE CHARACTER (210) comprising tho following steps- 

(A) Assigning a CHARACTER CODE (516) to a CHARACTER (210); 

(B) Creating a CHARACTER OUTLINE (212) for said CHARACTER (210)- 

(0) Determining SEGMENT ENDPOINTS (220) in a CHARACTER MODEL (250) obtained for said 
CHARACTER (210) using said CHARACTER OUTLINE (212); 

(D) Measuring local DIMENSIONS (240) for corresponding ASSOCIATIONS (230) in said CHAR- 
ACTER MODEL (250); 

CHA^AcTeR OUTLINE^ DESCRIPTORS (546) for each CONTOUR SEGMENT (214) of said 
(F) Determining global DIMENSIONS (240) applicable to said CHARACTER (210)- and 
?JnTTjJ^'"^ CHARACTER CODE (516), CHARACTER MODEL (250). local DIMENSIONS 
mM m^.'^hmI f f (546) for said CHARACTER (210) together with said global 

DIMENSIONS to form said STORED SCALEABLE CHARACTER (510), 

^^.o.^™"!''^'""' °' ^ ""'"^ CHARACTER MODEL (250) comprises a plurality of 

CHARACTER SUB-MODELS, together with ASSOCIATIONS (230) defining an inter-relationship of said 
plurality of CHARACTER SUB-MODELS. 

Tne method of claims 1., 2., or 3. wherein at least one of the CHARACTER MODELS (250) is 

?HlRtcTERTonpPr,^=n?'^°^'''^' ^'""^ ''''''''' ' °' previously'created 
n, iTMMc '^'^ ''^'"S '=^5^^ "P°" comparison of said CHARACTER 

uu I LINE (212) with each of said set of previously-created CHARACTER MODELS (250). 

The method of claims 1., 2., or 3. wherein at least one of the CHARACTER MODELS (250) is 
.'nu.'n^Tr: ' ^"^^^^C^ER OUTLINE (212) by creating a new CHARACTER MODEL (250) using 
said CHARACTER OUTLINE (212). ^ 

The nnethod of claims 1.. 2., or 3. wherein at least one of the global DIMENSIONS (240) is determined 
by selection from a previously-created set of measurements for global DIMENSIONS (240). 
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8. The method of claims 1 ,. 2., or 3. wherein at least one of the global DIMENSIONS (240) is determined 
by creating a new measurement for one or more global DIMENSIONS (240). aetermined 

follolTs/eps""'"'" °' " CHARACTER MODEL (250) for a CHARACTER (210) comprising the 
(A) Generating a representation of a CHARACTER OUTLINE (212) for said CHARACTfr 
(BUocating SEGMENT ENDPOINTS (220) on said represental^:! C^aSaJ^ 

In! !coo!°'^^'^^ ENDPOINTS (220) with SEGMENT ENDPOINT NUMBERS (226)' 

(D) Identifying ASSOCIATIONS (230) relating said SEGMENT ENDPOINTS (220) 

(E) Determining parameters for each of said ASSOCIATIONS (230)- and 

Taid^'cHARAcSR ^J.nw ^^'^ ASSOCIATIONS (230) to form 

saia UMARACTER MODEL (250) for said CHARACTER (210). 

CHArSeR f2?0?S''m.tl' ^"^"^^-^E" SUB-MODEL representative of a component of a 
UMAHAO I £R (210), said method comprising the following steps- 

SJA?AcTER^(2?or''''"'''°" °' ' CHARACTER OUTLINE (212) for said component of said 

(B) ^Locating SEGMENT ENDPOINTS (220) on said representation of said CHARACTER OUTLINE 

(C) Identifying said SEGMENT ENDPOINTS (220) with SEGMENT ENDPOINT NUMBERS (226)- 
D Identifying ASSOCIATIONS (230) relating said SEGMENT ENDPOINTS (220)- 

(E) Determining parameters for each of said ASSOCIATIONS (230)- and 

rjd'cHARAcSR SUB f ^'''^ ASSOCIATIONS (230) to form 

saia UMAHACTER SUB-MODEL for said component of said CHARACTER (430). 

follo:!nTst'eps'°"'"'°" °' ' CHARACTER MODEL (250) for a CHARACTER (210) comprising the 

(A) Generating a representation of a CHARACTER OUTLINE (212) for said CHARACTFR f210V 

(B) seating SEGMENT ENDPOINTS (220) on said representation of Lid Z^aJtEr' OUTLINE 

(C) Identifying said SEGMENT ENDPOINTS (220) with SEGMENT ENDPOINT NUMBERS (226)- 
D Identifying ASSOCIATIONS (230) relating said SEGMENT ENDPOINTS (220) 

^^^S^S^^"^ ^-"-^'^ -^-^ °' - --OINTS 
(F) Determining ASSOCIATIONS relating said CHARACTER SUB-MODELS- and 

?HARA?TlR'sUBMODEr^?'f" ^UB-MODELS and said ASSOCIATIONS ' (230, relating said 
CHARACTER SUB-MODELS to form said CHARACTER MODEL (250) for said CHARACTER (210). 

' ^HARACTERslslni"'"^ ' ^^^^f^SION (240) applicable to a plurality of STORED SCALEABLE 

^ZZij:^':::^.^:^'^ --^^-^ - - — ce de^ 

(A) Selecting a TYPEFACE DESIGN (514) from said one or more TYPEFACE DESIGNS (514)- 
B, Selecting a CHARACTER (210, belonging to the TYPEFACE DESIGN (5147selerd in step (A) 
e oon^n'T' "T\ P'^^^SIONS (240) for each occurrence of an ASsiclT ON (230? co - 
AcTeR (210)° ' ' ' CHARACTER MODEL (250) of said CHAR- 

^YPEFrE'DlsS^i'; ^^^""'"^ CHARACTERS (210) belonging to said ■ 

(E) Forming a candidate global DIMENSION (240) as an average value of the measurements of step 

(F) Ad.iusting said candidate global DIMENSION (240) to obtain the closest fit for all CHARACTERS 
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Fig. 2A Character Outline Defined Terms 
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Fig. 2B Character Model Defined Terms 
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Fig. 7 
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Fig. 8 
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Fig. lOA. NormalizedCubic Bezier Curve 
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Fig. lOB. Reconstructed Outline Segment 
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